An Empirical Comparison of Unknown Word Prediction Methods

نویسندگان

  • Kostadin Cholakov
  • Gertjan van Noord
  • Valia Kordoni
  • Yi Zhang
چکیده

We compare two types of methods which deal with unknown words in the context of computational grammars. Methods of the first type are based on the idea of supertagging and use a tagger to predict lexical descriptions for unknown tokens in a given input. The second type of methods perform lexical acquisition (LA) which, in the context of this paper, refers to the automatic acquisition of new lexical entries for the lexicon of a given grammar. The methods are compared based on the effect their application has on the parsing coverage and accuracy of the GG grammar of German (Crysmann, 2003). In particular, we adapt the LA method of Cholakov and van Noord (2010) which was originally developed for the Dutch Alpino system to be used with the GG. Its impact on coverage and accuracy on a test corpus of German newspaper texts is compared to the results reported previously on the same corpus for methods which employed a tagger. Furthermore, in a smaller experiment, we show that the linguistic knowledge this LA method provides can also be used for sentence realisation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bubble Pressure Prediction of Reservoir Fluids using Artificial Neural Network and Support Vector Machine

Bubble point pressure is an important parameter in equilibrium calculations of reservoir fluids and having other applications in reservoir engineering. In this work, an artificial neural network (ANN) and a least square support vector machine (LS-SVM) have been used to predict the bubble point pressure of reservoir fluids. Also, the accuracy of the models have been compared to two-equation stat...

متن کامل

ارزیابی و مقایسه روش‌های نیمه‌تجربی و عددی در پیش‌بینی مشخصات امواج بنادر امیرآباد و بوشهر

In this article, the accuracy of empirical and numerical methods in prediction of wind wave characteristics in AmirAbad and Bushehr Ports have been studied, evaluated and compared. First, the wave characteristics have been calculated using Babolsar and Bushehr synoptic stations and employing empirical methods including SMB, SPM and CEM. Moreover the errors of empirical methods have been determi...

متن کامل

تئوری رژیم و کاربرد آن برای جریان‌های یک‌نواخت و غیر یک‌نواخت

Suitable stable channel design and optimization of river geometry can reduce cost of projects. The regime theory provides the possibility of empirical and semi-empirical investigations of stable channel design in which erosion and sediment transport are in equilibrium. The objective of this research is an investigation and a comparison of the influence of uniform and non-uniform flows on the pr...

متن کامل

Estimation of Monthly Mean Daily Global Solar Radiation in Tabriz Using Empirical Models and Artificial Neural Networks

Precise knowledge ofthe amount of global solar radiation plays an important role in designing solar energy systems. In this study, by using 22-year meteorologicaldata, 19 empirical models were tested for prediction of the monthly mean daily global solar radiation in Tabriz. In addition, various Artificial Neural Network (ANN) models were designed for comparison with empirical models. For this p...

متن کامل

تئوری رژیم و کاربرد آن برای جریان‌های یک‌نواخت و غیر یک‌نواخت

Suitable stable channel design and optimization of river geometry can reduce cost of projects. The regime theory provides the possibility of empirical and semi-empirical investigations of stable channel design in which erosion and sediment transport are in equilibrium. The objective of this research is an investigation and a comparison of the influence of uniform and non-uniform flows on the pr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011